|
|
Accession Number |
TCMCG075C00956 |
gbkey |
CDS |
Protein Id |
XP_017981929.1 |
Location |
complement(join(4224111..4224191,4224394..4224465,4224885..4224950,4225268..4225299,4225484..4225643,4226231..4226348,4226917..4227062,4227859..4228104,4228194..4228301,4228397..4228518,4229026..4229116,4229183..4229413,4229530..4229673,4229972..4230048,4230407..4230530,4230884..4231051,4231347..4231712)) |
Gene |
LOC18611408 |
GeneID |
18611408 |
Organism |
Theobroma cacao |
|
|
Length |
783aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA341501 |
db_source |
XM_018126440.1
|
Definition |
PREDICTED: probable cytosolic oligopeptidase A [Theobroma cacao] |
CDS: ATGGTCAATGTCCTCATGGCTACTACTCGCTTCTCTCTCTCCCGCTCAGTCCATCCAATACCAAAGTTTTCTTCTCCTTCTCCCCATTTTACCCCCAAACTTTTGCGTAAATCCTATGCTTGTCCCCTCTGGTCTTCTTCTTTCTCCTTCTGCCTTGAGTCTTTTCACCACTCAACTTCTCCTTCTCTCTCTTTCTCTTCATTTTCTTCTTGCTCTTCTCTCTCCCCTCCTTCAATGGCTTCTTCAGCTTCTATTGATGAGAACATGGAGTCCAATCCTCTCTTGCAAGATTTCGACTTTCCACCTTTTGATGTTGTTGAAGCCAAGCATGTCAGGCCTGGGATTCGTGCCTTGTTGAAGAAACTCGAAAATGATTTGGATGAATTGGAGAAAACAGTGGAGCCATCGTGGCCAAAGTTGGTGGAGCCGTTGGAAAAGATTGTTGATCGGTTGACTGTTGTATGGGGAATGGTTAATCATCTTAAGTCTGTTAAAGATACAGCTGAGCTCCGTGCTGCCATTGAAGAAGTCCAGCCTGAAAAAGTGAAGTTTCAACTAAGATTGGGACAAAGTAAACCCATCTACAATGCTTTCAAGGCTATTAAAGAATCTCCTGATTGGCAATCACTGAGTGAAGCTCGCAAACGTATTGTAGAGACCCAGATAAAGGAAGCTGTTCTTAATGGTGTTTCACTTGAAGATGATAAAAGGGAACAGTTTAACAAAATTGAACAGGAGCTGGAGAGGCTGTCTCACAAATTTAGTGAGAATGTTTTGGATGCCACAAAAAAGTTTGAAAAGCTGATAACTGATAAGAAAGAAATCGAGGGTTTGCCAGCGACTGCTCTTGGGTTAGCTGCACAAACAGCAGTTTCTAAGGGGCATGAAAATGCTACTGCTGAGAACGGCCCGTGGATGATTACATTGGATGCTCCAAGTTTTATTTCTGTTATGCAACATGCTCGTAACCGTGCTTTGCGTGAGGAAGTCTACCGTGCTTATGTAACTCGGGCATCGAGTGGTGATTTGGATAATACGCCAATAATCAATCAGATATTGCAGCTTCGGTTGGAAAAGGCTAAGCTTCTCAATTACAAGAACTATGCTGAGGTAAGCATGGCAACCAAAATGGCTACTGTTAATAAAGCTGAGGAGCTATTAGAAAAGCTTCGGAGTGCTTCCTGGAATGCTGCTGTCCAAGATGTTGAAGACCTAAAAGATTACTCCAAGAGTCAAGGTGCACTAGAAGCTGATAATTTGAGCCATTGGGACATCAACTTCTGGAGTGAGAGGCTTCGTGAGTCAAAATACAACATCAATGAGGAAGAACTCCGGCCGTATTTCTCGTTTTCAAAGGTTATGGATGGCCTTTTCAACCTTGCTAAGACACTTTTTGGAATTGACATTGAGCCAGCTGATGGCCTGGCTCCTGTCTGGAACAAAGATGTCAGGTTCTATTGTGTCAAAGATTCTTCAGGTAGTCCAATTGCCTATTTTTATTTTGATCCATACTCTCGTCCATCAGAGAAAAGGGAAGGTGCATGGATGGATGAGGTTGTTTCTCGAAGTCATGTACTGTCAAGTAATGGTACCACTGCAAGGTTGCCTGTTGCCCATATGGTGTGCAATCAAACACCACCAGTTGGGGACAAGCCAAGCCTCATGACATTCCGTGAAGTTGAGACTGTCTTCCATGAATTTGGCCATGCACTTCAGCATATGCTGACCAAGCAAGATGAGGGTCTAGTTGCTGGCATTCGGGGGATTGAGTGGGATGCTGTTGAGTTGCCCTCTCAGTTCATGGAAAATTGGTGTTACCACAGGGAAACATTGATGAGCATTGCAAAGCATTATGAAACAGGGGAGACTCTCCCTGAGGAGGTGTACTTGAAGCTCCTTGCTGCAAGGACTTTCCGTGCTGGTTCTTTAAGTCTTCGTCAGCTTCGATTTGCTAGTGTTGATTTGGAGCTTCATACAAAATATATACCAGGTGGGTCAGAATCTGTTTATGATGTTGATCAGAGAGTTTCCAAAAGAACACAAGTGATTCCCCCATTGCCAGAAGATAGGTTCCTCTGTGGTTTCAACCATATATTTGCAGGTGGATATGCTGCTGGATATTACAGTTACAAGTGGGCAGAAGTGTTGTCTGCAGATGCTTTCTCAGCATTTGAGGATGCTGGATTGGAAGACAGCAAGGCTGTTAAAGAAACTGGCCACAAGTTCCGGGAGACCATTCTTGCTCTTGGAGGTGGAAAAGCACCATTAGAGGTCTTTGTTGAATTCCGTGGACGTGAACCTTCACCAGCGGCATTGCTCAGGCACAATGGATTGTTACCAGTCACAGCCTGA |
Protein: MVNVLMATTRFSLSRSVHPIPKFSSPSPHFTPKLLRKSYACPLWSSSFSFCLESFHHSTSPSLSFSSFSSCSSLSPPSMASSASIDENMESNPLLQDFDFPPFDVVEAKHVRPGIRALLKKLENDLDELEKTVEPSWPKLVEPLEKIVDRLTVVWGMVNHLKSVKDTAELRAAIEEVQPEKVKFQLRLGQSKPIYNAFKAIKESPDWQSLSEARKRIVETQIKEAVLNGVSLEDDKREQFNKIEQELERLSHKFSENVLDATKKFEKLITDKKEIEGLPATALGLAAQTAVSKGHENATAENGPWMITLDAPSFISVMQHARNRALREEVYRAYVTRASSGDLDNTPIINQILQLRLEKAKLLNYKNYAEVSMATKMATVNKAEELLEKLRSASWNAAVQDVEDLKDYSKSQGALEADNLSHWDINFWSERLRESKYNINEEELRPYFSFSKVMDGLFNLAKTLFGIDIEPADGLAPVWNKDVRFYCVKDSSGSPIAYFYFDPYSRPSEKREGAWMDEVVSRSHVLSSNGTTARLPVAHMVCNQTPPVGDKPSLMTFREVETVFHEFGHALQHMLTKQDEGLVAGIRGIEWDAVELPSQFMENWCYHRETLMSIAKHYETGETLPEEVYLKLLAARTFRAGSLSLRQLRFASVDLELHTKYIPGGSESVYDVDQRVSKRTQVIPPLPEDRFLCGFNHIFAGGYAAGYYSYKWAEVLSADAFSAFEDAGLEDSKAVKETGHKFRETILALGGGKAPLEVFVEFRGREPSPAALLRHNGLLPVTA |